Elimination Forest Guided D Sparse LU Factorization

نویسندگان

  • Kai Shen
  • Xiangmin Jiao
  • Tao Yang
چکیده

Sparse LU factorization with partial pivoting is important for many scienti c applications and delivering high perfor mance for this problem is di cult on distributed memory machines Our previous work has developed an approach called S that incorporates static symbolic factorization supernode partitioning and graph scheduling This paper studies the properties of elimination forests and uses them to guide supernode partitioning amalgamation and execu tion scheduling The new design with D mapping e ec tively identi es dense structures without introducing too many zeros in the BLAS computation and exploits asyn chronous parallelism with low bu er space cost The imple mentation of this code called S uses supernodal matrix multiplication which retains the BLAS level e ciency and avoids unnecessary arithmetic operations The experiments show that S improves our previous code substantially and can achieve up to GFLOPS on Cray T E MHz nodes which is the highest performance reported in the lit erature

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Postordering and Static Symbolic Factorization for Parallel Sparse LU

In this paper we present several improvements of widely used parallel LU factorization methods on sparse matrices. First we introduce the LU elimination forest and then we characterize the L, U factors in terms of their corresponding LU elimination forest. This characterization can be used as a compact storage scheme of the matrix as well as of the task dependence graph. To improve the use of B...

متن کامل

Efficient Sparse LU Factorization with Lazy Space Allocation

Static symbolic factorization coupled with D supernode partitioning and asyn chronous computation scheduling is a viable approach for sparse LU with dynamic partial pivoting Our previous implementation called S uses those techniques and achieves high giga op rates on distributed memory machines This paper studies the space requirement of this approach and proposes an optimization strategy calle...

متن کامل

Eecient Sparse Lu Factorization with Partial Pivoting on Distributed Memory Architectures

A sparse LU factorization based on Gaussian elimination with partial pivoting (GEPP) is important to many scientiic applications, but it is still an open problem to develop a high performance GEPP code on distributed memory machines. The main diiculty is that partial pivoting operations dynamically change computation and nonzero ll-in structures during the elimination process. This paper presen...

متن کامل

Parallel Sparse LU Factorization with Partial Pivoting on Distributed Memory Architectures

Gaussian elimination based sparse LU factorization with partial pivoting is important to many scientiic applications, but it is still an open problem to develop a high performance sparse LU code on distributed memory machines. The main diiculty is that partial pivoting operations make structures of L and U factors unpredictable beforehand. This paper presents an approach called S for paralleliz...

متن کامل

S+: Efficient 2D Sparse LU Factorization on Parallel Machines

Static symbolic factorization coupled with supernode partitioning and asynchronous computation scheduling can achieve high giga op rates for parallel sparse LU factorization with partial pivoting This paper studies properties of elimination forests and uses them to optimize supernode partitioning amalgamation and execution scheduling It also proposes supernodal matrix multiplication to speed up...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003